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Endogenous gene expression modification with regulatory 
element. 



5 FIELD OF INVENTION 

The present invention relates to a process for 
the modification of the expression characteristics of a 
gene which is natxirally present within the genome of a 
stable cell line or cloned microorganism. In the preferred 

10 embodiment, the present invention relates to a process for 
the activation and expression of a gene that is present 
within a stable cell line and normally transcriptionally 
silent or inert. As a result, the protein product of that 
gene is expressed. This phenomenon occurs without 

15 transf acting the cell with the DNA that encodes the 

product. Bather, the resident gene coding for the desired 
product is identified within a cell and activated by 
inserting an appropriate regulatory segment through, a 
technique called homologous recombination « Positive and/ or 

20 negative selectable markers can also be inserted to aid in 
selection of the cells in which, proper homologous 
recombination events have occurred. As an additional 
embodiment, a specified gene can be amplified for enhanced 
expression rates, whether that genesis normally 

25 transcriptionally silent and has been activated by means of 
the present invention, or endogenously expresses product. 

BACKGROUND OF THE INVENTION 

. It is well known that each cell within an 
30 organism contains ^e genetic information that encodes all 
of the proteins foxind within that organism. However, only 
a very small percentage of the genes present within a givien 
cell type is actually transcribed. The intracellular 
mechcuiisms that regulate the array of genes to be 
35 transcribed are now understood. Cell specific proteins 
present within the nucleus interact with DNA regulatory 
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S3§inents that are linked with particular genes. This 
interaction of nuclear proteins with DNA regulatory 
sequences is required for gene transcription. This results 
in mRNA biosynthesis cuid ultimate expression of the encoded 
5 protein (Mitchell and Tjian, Science ^ 245:371,1989). 

These DNA regulatory segments or elements for 
each gene lie upstream from and, in some cases, within or 
even downstream of the coding regions. Through an 
interaction with cell specific nuclear proteins, DNA 

10 regulatory segments affect the ability of HNA polymerase, 
the rate limiting enzyme in protein expression, to gain 
access to the body of the gene and synthesize a mSNA 
transcript. Thus, these DNA segments and the resident 
nuclear proteins play a critical role in the regulation of 

15 eaqiression of specific genes (Johnson and McKnight, Ann. 
Rev. Biochem, . 58:799, 1989). 

The DNA regulatory segments aure binding sites for 
the nuclear proteins. These nuclear proteins attach to the 
DNA helix and apparently alter its structure to make the 

20. desired gene available for RNA polymerase recognition, 
which facilitates gene transcription. The eaqpression of 
these cell specific regulatory proteins determines which 
genes will be transcribed within a cell and the rate at 
which this expression will occur. As an example' of ' the 

25 specificity of this system, pituitary cells but not liver 
cells express pituitary proteins, even though the genes for 
the pituitary proteins are present within all liver cells. 
Nuclei of the liver cells do not contain the specific DNA 
binding proteins which interact with the elements of 

30 pituitary genes .resident within the liver cells. 

Current Methods Emploved to Express Proteins Using 
Recombinant DNA TeehnQlogy 

With the knowledge that specific DNA regulatory 
35 sequences are required to activate gene transcription 

within a particular cell type, scientists have expressed 
foreign genes within a particular cell type through genetic 
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enc^ineering. In general, DNA regulatory segments that are 
recognized by the cell's nuclear proteins btb placed 
upstream from the coding region of a foreign gene to be 
expressed. In this way, after insertion iiito the cell, 
5 foreign DNA may be expressed since the cell's nuclear 
regulatory proteins now recognize these DNA regulatory 
sequences. This technology has been employed to produce 
proteins that have been difficult to obtain or purify from 
natural sources by traditional piirif ication strategies. 
10 In addition to the recognizable DNA sequences and 

the gene of interest, a selectable marker is attached to 
the DNA construction. In this way, only the cells that 
have taken up the DNA survive following culture in a 
selectable medixim. For example, the gene for neomycin 
15 resistance may be included in the expression vector. 
Following transfection, cells are cultured in G418, a 
neomycin antibiotic that is lethal to mammalian cells. If 
however, the cells have acquired the neomycin resistance 
gene, they will be able to withstand the toxic effects of 
20 -the drug. In this way, only the cells , that have taken up 
the transf ected DNA are maintained in culture. It is 
understood that any selectable marker could be used as long 
as it provided for selection of cells that had taken up the 
transf ected DNA. It is further understood that there is no 
25 criticality as to the specific location of the inserted 

genetic material within the cell. It is only important that 
it be taken up somewhere within the nucleus as both the 
regulatory segment and the foreign gene (as well as the 
selectable marker) are inserted together. 

30 

Def leiene ies In the Current Methods of Gene Expression 

While the above techniques have been instrumental 
in exploiting the power of genetic engineering, they have 
not always been the most efficient methods to express 
35 genes. This is due to the fact that insertion of DNA into 
the nucleus of a cell line is usually accomplished through 
a technique known as transfection. DNA that has been 
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30 



35 



engineered for expression in the cell line of interest is 
precipitated and the cell nembrane is solubilized to allow 
entry of the DNA. As indicated above, the exact site into 
which the DNA incorporates into the genome is never 
predictable; indeed the DNA may remain episomal (not 
integrated into the genome) . This results in the 
unpredictability of both the level of expression of the 
protein produced and the stability of the cell line. 

A second shortcoming of this technique is the 
fact that the construction of the ejcpression vector is 
extremely difficult when the gene of interest is relatively 
large (greater than 5-10 kilobases) . Many of the proteins 
expressed by recombinant DKA technology have been encoded 
by cDNAs rather than much larger genomic clones. This is 
15 done to reduce the overall size of the insert. While the 
use of CDNAs makes genetic engineering more convenient, 
rates of gene transcription and protein production may' 
suffer as a result. It has recently been shown that 
e^qiression levels are sometimes greatly enhanced through 
the use of genomic rather than cDNA inserts (Brinster et 

PrP<?t Kattt ftp?^rl, — Ssl^, 85:836-840, 1988, and Chung 

and Perry, ytol. Cell,. sial*., 9:2075- 2082, 1989) . 

Although the mechanisms responsible for this observation 
are not well understood, it is known that in certain 
situations enhancer elements present within introns can 
improve the transcriptional efficiency of the gene. There 
is also evidence that introns, or the splicing events which 
result from the presence of introns, may have an effect on 
the RNA processing events which follow the initiation of 
transcription (Buchman and Berg, Mol. Cell. , 8:4395- 

4405, 1988). This may stabilize the transcript thereby 
improving the rate of mRNA accumulation, in the above 
cited Brinster et al paper, it is also postulated that the 
position of the introns within the gene may be important 
for phasing of nucleosomes relative to the promoter. The 
influence of various regulatory elements on transcription 
of eukaryotic genes is discussed in Khoury et al, £ell. 



20 



25 
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33:313-14 (1983), Maniatis et al. Science . 236:1237-45 

(1987) and Muller et al, Eur. J. Bioehem. ^ 176:485-95 

(1988) . 

Thirdly, to gain entry into the nucleus, the 
S transfected DNA, including the entire coding region of the 
foreign gene, must traverse the cytoplasm following entry 
through the permeabilized plasiaa nesibrane of the cell. 
During that time, the DNA may come in contact with 
lysosomal enzymes which may alter or completely destroy the 
10 integrity of the DNA. Thus, the coding region of the DNA 
may not be identical to that which was transfected. 

The novel method of gene activation and/or 
expression modification that we describe below cannot 
result in the production of mutant forms of the desired 
15 protein, since the coding region of the desired gene is not 
subjected to enzymatic modifications. 

In summary, a large amount of the DNA transfected 
into the cell using traditional techniques, and 
particularly the coding region thereof, will not be 
20 faithfully transcribed. It may be degraded prior to entry 
into the nucleus, enzymatically perturbed so that it will 
not encode the entire desired protein or it may not contain 
all of the necessary regulatory segments to allow for 
transcription, it may be inserted into a portion of the 
25 genome that prevents transcription. If tdie cDNA is 

transcribed, the protein of interest may not be produced 
efficiently due to the omission of introns which may 
contain enhancers or enable efficient mBNA processing. 
Finally, it may remain episomal, promote protein production 
10 but be unstable as' the cell population grows through cell 
division. 

It would be most desirable to develop a method of 
induction of gene expression that would produce a cell line 
that has incorporated the positive attributes of the 
5 existing methods but somehow circumvents the unattractive 
features. It would further be desirable to be able to 
express or modify endogenous expression of particular genes 
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in the cell type of choice. It is further desired to be. 
able to take advantage of the potential* benefits that may 
be afforded by a complete genomic sequence which may 
include cryptic transcriptional enhancers that may reside 
within introns, by appropriate placement of introns for 
proper nucleosome phasing or by more efficient mRNA 
processing events. These advemtages are ordinarily not 
enjoyed in recombinant DNA expression methods due to the 
size of the gene. If one were able to ejcpress a gene that 
is already resident in the genome, i.e., an endogenous 
gene, cell line stability and expression rates would become 
more consistent and predictable. 

SUMMARY OF THE TWVTgWTTnff 
Accordingly, it is an object of the present 
invention to eliminate the above-noted deficiencies in the 
prior art. 

It is another object of the preset invention to 
provide a method of regulation and/or amplification of gene 
escpression that incorporates the positive attributes of 
recombinant gene technology but circumvents the 
unattractive features. 

It is a further object of the present invention 
to provide a method for expressing specific genes pres'^ 
but normally transcriptionally silent in a cell line of 
choice. 

It is yet a further object of the present 
invention to provide a method for expressing proteins which 
takes full advantage of complete genomic sequences that are 
responsible for ml?NA accumulation and/ or transcription. 

It is still another object of the present 
invention to provide a method of modifying the expression 
characteristics of a gene of interest by iMerting DNA 
regulatory segments and/or amplifying segments into the 
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genome of a stable cell line or cloned microorganism 
upstream of, within, or otherwise proximal to the native 
gene of interest. 

It is still a further object of the present 
5 invention to provide a method for modifying the expression 
characteristics of a gene which is naturally present within 
the genome of a stable cell line or cloned microorgemism 
and at the same time insert cheuracteristics which will aid 
. in the selection of cells which have been properly 
10 modified. 

It is yet another object of the present invention 
to provide a genome having therein, proximal to the coding 
region or exons of a gene of interest, a regulatory, or 
amplifying segment which does not natxirally appear 
15 thereat. 

It is another object of the present invention to 
provide DNA constructs which can be used for accomplishing 
the homologous recombination methods of the present 
invention. 

20 It is a further object of the present invention 

to provide cell lines and microorganisms which include the 
genomes in accordance with the present invention. 

These and other objects of the present invention 
are accomplished by means of the technique of homologous 

25 recombination, by which one of ordinary skill in this art 
can cause the expression and, preferably, amplification of 
resident, albeit transcriptionally silent genes. By this 
technique, one can also modify the expression 
cdiaracteristics of a gene which is naturally present, but 

30 not necessarily silent or inert, within the genome of a 
stable cell line, such as, for example, to make the 
expression conditional, i.e., repf^essible or inducible, or 
to enhemce the rate of expression. 

The present invention provides a method of 

35 modifying the expression characteristics of a gene within 
the genome of a cell line or microorganism. A DNA 
construct is inserted into that genome by the technique of 
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homologous recombination. The construct includes a DNA 
regulatory segment capable of modifying the escpression 
characteristics of any gene to which It is operatively 
linked within the host cell line or microorganism, as well 
5 as a targeting segment homologous to a region of the genome 
at which it is desired for the DNA regulatory segment to be 
inserted. The construct and insertion technique is 
designed to cause the new DNA regulatory segment to be 
operatively linked to the gene of interest. Thus, without 
10 necessarily inserting any new coding exons, the expression 
characteristics of that gene are modified. In the 
preferred embodiment, the gene is one whidh.is normally 
transcriptionally silent or inert within the host cell line 
or microorganism and, by means of the DNA regulatory 
15 region, which is targeted directly to the appropriate 

position with respect to that gene by means of homologous 
recombination, that gene is thereby activated for 
e3q>ression of its gene product. 

The DNA construct preferably includes two 
20 targeting segments which, while separated from one another 
in the construct by those elements to be inserted into the 
genome, are preferably contiguous in the native gene. 

The construct further preferably includes at 
least one expressible selectable marker gene, such as the 
25 gene providing neomycin resistance. This marker gene, 

including a promoter therefor, is also disposed between the 
two targeting regions of the construct. 

In another embodiment, the construct includes an 
expressible amplif iable gene in order to amplify expression 
30 of the gene of in1;erest. This gene, including a promoter 
therefor, is also disposed between the two targeting 
regions of the construct. In some cases the selectable 
marker and the amplif iable marker may be the same. 

In a further embodiment of the present invention, 
35 the DNA construct includes a negative selectable marker 
gene which is not expressed in cells in which the DNA 
construct is properly inserted. This negative selectable 
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marker gene is disposed outside of the two targeting 
regions so as to be removed when the construct is properly 
combined into the gene by homologous recombination. An 
example of such a negative selectable marker gene is the 
5 Herpes Simpleic Virus thymidine kinase gene. 

In yet a further embodiment, it is possible to 
modify the expression characteristics of a specific gene 
which already expresses a product in the cell line or 
microorganism of interest. This can be accomplished by 
10 inserting by homologous recombination a DNA construct which 
includes (1) an expressible amplifiable gene which 
increases the copy number of the gene of interest when the 
cell line or microorganism is subjected to amplification 
conditions and/ or (2) a promoter/ enhancer element (or other 
15 regulatory element) which modifies the expression of the 
gene of interest such as, for example, by increasing the 
rate of transcription, increasing translation efficiency, 
increasing mRNA accumulation, making the expression 
inducible, etc. The gene expression which is modified in 
20 this manner may be natural expression or expression which 
has been caused by previous genetic manipulation of the 
cell line or microorganism* The previous genetic 
manipulation may have been by conventional techniques or by 
means of homologous recombination in accordance with the 
25 present invention. In the latter case, the DNA insertion 
which results in the modification of expression 
chcuracteristics may be accomplished as part of the same 
genetic manipulation which results in expression of the 
gene or nay be performed as a subsecpient step* 
30 The present invention also includes the 

constructs prepeured in accordance with the above discussion 
as well as the genomes which have been properly subjected 
to homologous recombination by means of such constructs and 
the cell lines and microorganisms including these genomes. 



35 
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Moreover, a process for preparation of the desired product 
by culturing the transformed cells according to the present 
invention is also included. 

^ BRIEF DESCRIPTTOW OF THE DRAWTWCS 

Fig. 1 shows a general outline of a DNA construct 
in accordance with the present invention. 

Fig. 2A shows the mode of integration of the DNA 
construct into the genome in the event of non-homologous or 
10 random recombination. 

Pig. 2B shows the mode of integration of the DNA 
construct in the genome in the event of homologous 
recombination. 

Fig. 3 shows the construction of a preferred 
homologous recombination vector in accordance with the 
present invention. 

Fig. 4 shows the mode of integration of a 
circular piece of DMA by homologous recombination when only 
a single targeting piece of DNA is employed. 

Fig. 5 shows the pRSVCAT plasmid, including the 
restriction sites thereof. 

Pig. 6 shows the construction of the pRSV 
plasmid, including the restriction sites thereof. 

Fig. 7 shows the pSV2NE0" plasmid, including the 
25 restriction sites thereof. 

Fig. 8 shows the construction of the pSVNEOBAM 
plasmid, including the restriction sites thereof. 

Fig. 9 shows the construction of the pRSVNEO 
plasmid, including the restriction sites thereof. 

Fig. 10, shows the construction of the pRSVCATNEO 
plasmid, including the restriction sites thereof. 

Pig. 11 shows a 15.3 kb fragment of the rat TSHB 
gene and showing various restriction segments thereof. 

Pig. 12 shows the construction of the 
PRSVCATNE0TSHB3 plasmid, including the restriction sites 
thereof. 



20 
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Fig. 13 shows the construction of the 
pRSVCATNE0TSHB3-5XbaZ plasmld, including the restriction 
sites thereof. 

Fig. 14 shows a portion of the nucleotide 
5 sequence of TSHfi along with the regions thereof to which 
each primer for PGR amplification corresponds. Exons 2 and 
3 are shown in capital letters. A 247 BP amplified 
fragment is shown by underlined asterisks. 

Fig.' 15 shows the results of polyacrylamide gel 
10. electrophoresis of cDNA synthesized from RNA extracted from 
various cell populations and whose TSHfi cDNA, if present, 
has been amplified by PGR. The nature of the cells 
representing the various lanes is set forth in Fig. 15 
below the gel. 

15 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Homologous recombination is a technique developed 
within the past few years for targeting genes to induce or 
correct mutations in transcriptionally active genes 
. 20 (Kucherlapati, Proa, in Nucl> Acid Res, and Mol. Biol. > 

36:301 (1989)). This technique of homologous recombination 
was developed as a method for introduction of specific 
mutations into specific regions of the mammalian genome 
(Thomas et al.. Cell , 44:419-428, 1986; Thomas and* 
25 Capecchi, Cell, 51:503-512, 1987; Doetschman et al., Proc, 
Natl. Acad. Sci, , 85:8583-8587, 1988) or to correct 
specific mutations within defective genes (Doetschman et 
al., Nature . 330:576-578, 1987). 

Through this technique, a piece of DNA that one 
30 desires to be inserted into the genome can be directed to a 
specific region of the gene of interest by attaching it to 
"targeting DNA" . "Targeting DNA" is DNA that is 
complementary (homologous) to a region of the genomic DNA. 
If two homologous pieces of single stranded DNA (i.e., the 
35 targeting DNA and the genomic DNA) sure in close proximity, 
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they will hybridize to form a double stranded helix. 
Attached to the targeting DNA is the DNA setjuence that one 
desires to insert into the genome. 

There are a number of methods by which homologous 
5 recombination can occur. One example is during the process 
of r^lication of DNA during mitosis in cells. 

Through a mechanism that is not completely 
understood, parental double-stranded DNA is opened 
immediately prior to cell division at a local region called 
10 the replication bubble. The two separated strands of DNA 
may now serve as templates from ^rtxich new strands of DNA 
are synthesized. One arm of the replication fork hak the 
DNA code in the 5- to 3- direction, which is the 
appropriate orientation from which the enzyme DNA 
15 polymerase can "read". This enzyme attaches to the 5« 

portion of the single stranded DNA and using the strand as 
a template, begins to synthesize the complementary DNA 
strand. The other parental strand of DNA is encoded in the 
3' to 5" direction, it cannot be read in this direction by 
20 DNA. polymerase. For this strand of DNA to replicate, a 
special mechanism must occur. 

A specialized enzyme, SNA primase, attaches 
itself to the 3' to 5' strand of DNA and synthesizes a 
short RNA primer at intervals along the strand. Using 
25. these RNA segments as primers, the DNA polymerase now 

attaches to the primed DNA and synthesizes a complementary 
piece of DNA in the 5 • to 3 • direction. These pieces of 
newly synthesized DNA are called Okazaki ^r-^rrm^^^Ho 
RNA primers that were responsible for starting the entire 
30 reaction are removed by the exonudease function of the DNA 
polymerase and replaced with DNA. This phenomenon 
continues until the polymerase reaches an unprimed stretch 
of DNA, where the local synthetic process stops. Thus, 
although the complementary parental strand is synthesized 
overall in the 3 • to 5« direction, it is actually produced 



35 
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by "backstitching" in the 5' to 3» direction. Any nicks 
that might occur in the DNA during the *"backstitching" 
process are sealed by an enzyme called DNA ligase. 

To maintain an absolute fidelity of the DNA code, 
5 a proofreading function is present within the DNA 

polymerase. The DNA polymerase requires primed pieces of 
DNA upon which to synthesize a new strand of DNA. As 
mentioned above, this can be a single strand of DNA primed 
with RNA, or a complementary strand of DNA. When the DNA 
10 polymerase finds mismatched complementary pieces of DNA, it 
can act as an exonuclease and remove DNA bases in a 3' to 
5* direction tintil it reaches perfect matching again. 

With this background, it is now possible to 
understand the basis of the technique described herein. 
15 Small pieces of targeting DNA that are complementary to a . 
specific region of the genome are put in contact with the 
parental strand during the DNA replication process. It is 
a general property of DNA that has been inserted into a 
cell to hybridize and therefore recombine with other pieces 
20 of DNA through shared homologous regions. If this 

complementary strand is attached to an oligonucleotide that 
contains a mutation or a different secpience of DNA, it too 
is incorporated into the newly synthesized strand as a 
result of the recombination. As a result of the proof- 
25 reading function, it is possible for the new sequence of 

DNA to serve as the template. Thus^ the transfected DNA is 
incorporated into the genome. 

If the sequence of a. particular gene is known, a 
piece of DNA that is complementary to a selected region of 
30 the gene can be synthesized or otherwise obtained, such as 
by appropriate restriction of the native DNA at specific 
recognition sites bounding the region of interest. This 
piece will act as a targeting device upon insertion into 
the cell and will hybridize to its homologous region within 
35 the genome. If this hybridization occurs during DNA 

replication, this piece of DNA, and any additional sequence 
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attached thereto, will act as an Okazalci fEagment and will 
be backstltched into the newly synthesized daughter strand 
of DNA. 

In the technique of the present invention, 
5 attached to these pieces of targeting DNA are regions of 
DNA that are known to interact with the nuclear regulatory 
proteins present within the cell and, optionally, 
amplifiable and selectable DNA markers. Thus, the 
expression of specific proteins nay be achieved not by 
10 transfection of DNA that encodes the gene itself and marker 
DNA, as is most common, but rather by the use of targeting 
- DNA (regions of homology with the endogenous gene of 
interest) coupled with DNA regulatory segments that provide 
the gene with recognizable signals for transcription, with 
this technology, it is possible to express and to amplify 
any cognate gene present within a cell type without 
actually transfecting that gene, m addition, the 
expression of this gene is controlled by the entire genomic 
DN& rather than portions of the gene or the cDNA, thus 
improving the rate of transcription and efficiency of mRNA 
processing. Furthermore, the «5)ression characteristics of 
any cognate gene present within a cell type can be modified 
by appropriate insertion of DNA regulatory segments and 
without inserting entire coding portions of the gene of 
25 interest. 

in accordance with these aspects of the instant 
invention there are provided new methods for expressing 
normally transcriptionally silent genes of interest, or for 
modifying the expression of endogenously expressing genes 

30 of interest, within a differentiated cell line. The 

cognate genomic selquences that are desired to be expressed, 
or to have their expression modified, will be provided with 
the necessary cell specific DNA sequences (regulatory 
and/or amplification segments) to direct or modify 

35 expression of the gene within the cell. The resulting DNA 
will comprise the DNA sequence coding for the desired 
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protein directly linked in an operative way to heterologous 
(for the cognate DNA sequence) regulatory and/or 
amplification segments. A positive selectable marker is 
optionally included within the construction to facilitate 
5 the screening of resultant cells. The use of the neomycin 
resistance gene is preferred^ although any selectable 
marker may be employed. Negative selectable markers may, 
optionally, also be employed. For instance, the Herpes 
Simplex Virus thymidine kinase (HSVtk) gene may be used as 
10 a marker to select against randomly integrated vector DNA. 
The fused DNAs, or existing expressing DNAs, can be 
. amplified if the targeting DNA is linked to an amplif iable > 
marker. 

Therefore, in accordemce with the method of the 
15 present invention, any gene which is normally expressed 
when present in its specific eukaryotic cell line, 
particularly a differentiated cell line, can be forced to 
expression in a cell line not specific for it wherein the 
gene is in a silent format. This occurs without actually 
20 inserting the full DNA sequence for that gene. In 

addition, that gene, or a normally expressing gene, can be 
amplified for enhanced expression rates. Furthermore, the 
expression characteristics of genes not totally 
transcriptionally silent can be moclified as can the 
25 expression characteristics of genes in microorganisms. 

In one embodiment of the present invention, 
eukaryotic cells that contain but do not normally 
transcribe a specific gene of interest are induced to do so 
by the technique described herein. The homologous 
30 recombination vector described below is inserted into a 
clonal cell line and, following chemical selection, is 
monitored for production of a specific gene product by any 
appropriate means, such as, for example, by detection of 
mRNA transcribed from the newly activated gene, 
35 immunological detection of the specific gene product, or 
functional assay for the specific gene product. 
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The general outline of the DNA construct that is 
used to transcriptionally activate enddgenous genes by 
homologous recombination is depicted in Figure i. 

In general, the DNA construct comprises at least 
two and up to six or more separate DNA segments. The 
segments consist of at least one, preferably two, dna 
targeting segments (A and B) homologous to a region of the 
cell genome within or proximal to the gene desired to be 
expressed, a positive selection gene (C) , an amplif iable 
gene (D) , a negative selection gene (E) and a DNA 
regulatory segment (P) which is transcriptionally active in 
the cell to be transfected. In the most basic embodiment- 
of the present invention, only a single targeting segment 
(B) and the regulatory segment (F) must be present. All of 
the other regions are optional and produce preferred 
constructs. 

Regions A and B are DNA sequences which are 
homologous to regions of the endogenous gene of interest 
which is to be made transcriptionally active. The specific 
regions A and B of the endogenous gene are selected so as 
to be upstream and downstream, respectively, of the 
specific position at which it is desired for the regulatory 
segment to be inserted. Although these regions are 
separated in the construct they are preferably contiguous 
xn the endogenous gene. There may be occasions where non- 
contiguous portions of the genome are utilized as targeting 
segments, for example, where it is desired to delete a 
portion of the genome, such as a negative regulatory 
element. 

While two targeting regions, a and B, are 
preferred in order to increase the total regions of 
homology and thus increase recombination efficiency the 
process of the present invention also comprehends the use 
Of only a single targeting region. i„ its sia^lest form 
(When only the regulatory segment F and the selectable 
marker gene C and promoter C« are to be inserted) a 
circular piece of DNA is employed which contains ^ese 
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elements along with the targeting DNA (see Figure 4) . in 
this way, the homologous region (B) hybridizes with its 
genomic counterpart. Segments C, C and F are inserted 
within the B portion of the cognate gene following the 
5 crossover event. 

When it is desired for the DNA regulatory 
sequence to be inserted upstream of the gene of interest, 
as, for example, when it is desired to activate and eiqjress 
a normally transcriptionally silent gene, the region of 
10 homology is preferably homologous to a non-coding portion 
of the genome upstream of the coding portions of the gene 
of interest. When two targeting regions are present, the 
downstream region (A) may include a portion of the coding 
region, although it is preferred that it, too, be totally 
is upstream of the coding region. It is further preferred 
that the homologous regions be chosen such that the DNA 
regulatory sequence will be inserted downstream ' of the 
native promoter for the gene of interest, particularly if 
the native promoter is a negative promoter rather than a 
20 turned-off positive promoter. 

The size of the targeting regions, i.e., the 
regions of homology, is not critical, although the shorter 
the regions the less likely that they will find the 
appropriate regions of homology and recombine at the 
25 desired spot. Thus, the shorter the regions of homology, 
the less efficient is the homologous recombination, i.e., 
the smaller the percentage of successfully reeombined 
clones. It has been suggested that the minimum requirement 
for sequence homology is 25 base pairs (Ayares et al, pnas. 
30 2§A, 83:5199-5203, ,1986). Furthermore, if any of the other 
elements of the construct are also found in the genome of 
the host cell, there is a possibility of recombination at 
the wrong place. However, in view of the excellent 
positive and negative selectability of the present 
35 invention, it can be successfully practiced even if the 
efficiency is low. The optimum results are achieved when 
the total region of homology, including both targeting 
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regions, is large, for example one to three kilobases. As 
long as the regulatable segment F can bfe operatively linked 
to the gene of interest there is no limit to the size of 
the targeting region, and particularly the upstream 
) targeting region B. 

It can. easily be empirically determined whether 
or not the targeting regions are too large or the 
regulatable segment P spaced too far from the coding region 
Of the gene to be operatively lirfced thereto. i„ such a 
case, the regions A and B can be made homologous to a 
different section of the gene of interest and the process 
repeated until the regulatable segment F is properly 
inserted so as to be operatively linked to the gene of 
Interest. For example, the restriction site of combined 
regxon A-B of the endogenous gene can be changed and the 
process repeated, once the concept of the present 
invention is known, along with the techniques disclosed 
h«ein, one of ordinary skill in this art would be able to 
Bake and use the present invention with respect to any 
gxven gene of interest in any cell line or microorganism 
Without use Of undue experimentation. 

^ Region c is a positive selectable marker gene 

Which is capable of rendering the transfected cell line 
resistant to a normally toxic environment. Examples of 
such genes are adenosine deaminase (ADA), aminoglycoside 
Phosphotransferase (neo), dihydrofolate reductase (DHBll, 
hygromycin-B-phosphotransf erase (hph, , thymidine kinase 
(tk) xanthine-guanine phosphoribosyltransf erase (gpt) 
multiple drug resistance gene (MDR) , ornithine ' 

decarboxylase (ODC, and N-fphosphonacetylj-L-aspartate 
resistance (CAD). 

in addition to the positive selectable marker 
^ne, an amplifiable gene is also optionally included in 
the construct at region D. Amplifiable genes are genes 
that lead to an increase in copy number when under 
selective pressure. The copy number of a gene positioned 
adjacent to the amplifiable gene will also incrLe 
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Amplifiable genes that can be utilized include DHFR, MDR, 
ODC, ADA and CAD, The mexobers of the positive selectable 
marker gene group and those of the amplifiable gene group 
overlap so that, in theory, instead of using two genes, one 
5 for positive selection, and one for amplification, one gene 
could Ibe used for both purposes. However, since most cell 
lines contain endogenous copies of these amplifiable genes, 
the cells will already be somewhat resistant to the 
selection conditions and distinguishing the cells which 
iO have transfected DNA from those which do not receive 

transfected DNA can be difficult. Thus, in instances where 
an amplifiable gene is desired, a positive selection gene 
which is dominant, such as HPH, gpt, neo and tk (in tk- 
cells) , should also be included in the construct. For some 
15 applications it may be possible or preferable to omit the 
amplif iabl^e marker. For instance, the gene of interest may 
not need to.be amplified as, for example, when 
transcrijptional activation by the heterologous DNA 
regulatory sequence is sufficient without amplification. 
20 Also, if the homologous recombination efficiency is very 

low, it may be necessary to leave out the amplifiable gene 
since the ratio of non-homologous DNA to homologous DNA is 
directly related to the homologous recombination efficiency 
(Letsou, Genetics . 117:759-770,' 1987). It is also possible 
25 to eliminate the positive selection gene and select cells 
solely by screening for the production of the desired 
protein or mRNA. However, it is preferred in most cases to 
include at least the positive selection gene. 

Region E of the construct is a negative 
30 selectable meurker gene. Such a gene is not e^^ressed in 
cells in which the DNA construct is properly inserted by 
homologous recombination, but is expressed in cells in 
which the DNA construct is inserted improperly, such as by 
random integration. One such gene is the Herpes Simplex 
35 virus thymidine kinase gene (HSVtk) . The HSVtk has a lower 
stringency for nucleotides and is able to phosphorylate 
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nucleotide analogs that normal aaiinaalian cells are unable 
to phosphorylate. If the HSVtk is present in the cells, 
nucleotide analogs sucH as acyclovir and gancyclovir are 
phosphorylated and incorporated into the DNA of the host 
cell thus killing the cell. The presence of the negative 
selectable marker gene enables one to use the positive- 
negative selection for homologous recombination as 
described by Mansour et al (Nature, 336:348-352, 1988). 
Capecchi uses a strategy which takes advantage of the 
differing modes of integration that occur when linearized 
vector DNA inserts via homologous recombination as compared 
to when it inserts by random integration. - If the vector 
DNA inserts randomly, the majority of the inserts wUl 
insert via the ends (Polger et al, Mol. c^M . pf»-^ , 
15 2:1372-1387, 1982; Roth et al, Mol. p.., s^jg^g. 

2607, 1985; and Thomas et al, CsU, 44:419-428, 1986). on 
the other hand, if the vector inserts by homologous 
recombination, it will recombine through the regions of 
homology which cause the loss of sequences outside of those 
20 regions. 

Using the construct depicted in Figure i as an 
example, the mode of integration for homologous 
recombination versus random integration is illustrated in 
Figures 2A and 2B. m the case of non-homologous 
recombination (Figure 2A) , the vector is inserted via the 
ends of the construct. This allows region E, in this case 
the HSVtk gene, to be inserted into the genome. However, 
when homologous recombination occurs (Figure 2B) , the HSVtk 
gene is lost. The first round of selection uses the 
appropriate drug or conditions for the positive selection 
present within the construct. Cells which have dna 
integrated either by homologous recombination or random 
integration will survive this round of selection. The 
surviving cells are then exposed to a drug such as 
gancyclovir which will kill all the cells that contain the 
HSVtk gene, in this case, most of the cells in which the 
vector integrated via a random insertion contain the HSVtk 
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gene and are killed by the drug while those in which the 
vector integrated by homologous recombination have lost the 
HSVtk gene and survive.* This allows the elimination of 
most of the cells which contain randomly integrated DNA, 
5 leaving the majority of the surviving cells containing DNA 
which integrated via homologous recombination. This 
greatly facilitates identification of the correct 
recombination event. 

The negative selection step can also be 
10 eliminated if necessary. It will require that the 

screening step be more labor intensive involving the need 
for techniques such as polymerase chain reaction (PGR) or 
immunological screening. 

The sixth region (P) contains the DNA regulatory 
15 segment that will be used to make the gene of interest 

transcriptionally active • The appropriate DNA regulatory 
segment is selected depending upon the cell type to be 
used. The regulatory segment preferably used is one which 
is known to promote expression of a given gene in 
20 differentiated host cell line. For example, if the host 
cell line consists of pituitary cells which naturally 
express proteins such as growth hormone and prolactin, the 
promoter for either of these genes can be used as DNA 
regulatory element F. When inserted in accordance with the 
25 present invention, the regulatory segment will be 

operatively linked to the normally transcriptionally silent 
gene of interest and will stimulate the transcription 
and/or expression of that gene in the host cell line. Also 
usable are promiscuous DNA regulatory segments that work 
30 across cell types/ such as the reus sarcoma virus (RSV) 
promoter. As long as the regulatory segment stimulates 
trsmscription and/ or expression, or can be induced to 
stimulate transcription and/or expression, of the gene of 
interest after being inserted into the host cell line so as 
35 to be operatively linked to the gene of interest by means 
of the present invention, it can be used in the present 
invention. It is important when joining the regulatory 
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segment F to the targeting segment A that no starting codon 
be accidentally introduced into the sequence since such an 
occurrence could alter the reading frame of the gene which 
is desired to be expressed. Of course, the construct must 
be constructed and inserted such that the regulatory 
segment F is operatively linked to the gene of interest. 

The DNA regulatory segment, region F, need not be 
present in instances where it is desired to enhance or 
anqplify the transcription of a gene which is already 
expressing in the cell line of interest, either because it 
naturally Bxpresaes in that cell line or because the cell 
line has previously had its DNA manipulated to cause such 
expression. In such instances, insertion of an aaplifiable 
gene, region D, preferably with the positive selectable 
marker gene, region C, and optionally also with a negative 
selectable marker gene, region E, will be sufficient to 
increase toe copy number of the gene of interest and thus 
enhance the overall amount of transcription. 
Alternatively, a new regulatory segment, region F, 
inherently promoting an increased (or otherwise modified) 
rate of transcription as compared to the existing 
regulatory region for the gene of interest, may be included 
to further enhance the transcription of the existing 
expressing gene of interest. Such a new regulatory segment 
could include promoters or enhancers which improve 
transcription efficiency. 

Regions C, D- and E» are promoter regions which 
are used to drive the genes in regions C, D, and E, 
respectively. These promoters are transcriptionally active, 
in the cell line chosen and may be the same or different 
from the promoter 'in region P used to drive the endogenous 
gene of interest. The specific direction of transcription 
specified in Fig. i is not critical. Those of ordinary 
skill in this art can determine any appropriate placement 
of the genes C, d and E and their promoters C, D' and E' 
such that the promoters will stimulate expression of their 
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associated genes without simultaneously disrupting in any 
way the expression of the gene of interest or any of the 
other genes of the construct. 

The present invention may be illustrated by 
5 reference to the activation of the rat thyrotropin beta 

sxibunit (TSHfi) in GH, (ATCC CCL 82) , GH3 (ATCC CCL 82.1} or 
GH4 CI cell lines (GH) • GH cell lines are derived from a 
radiation induced pituitcu:y tumor in rats designated HtT/W5 
(Takemoto^ Cancer Re&> . 22:917, 1962) and adapted to grow 
10 in culture by Tashjian et al, Endocrinolocrv . 82:342-352, 
1968. These cell lines, may be subcloned and screened for 
• their a:bility to produce growth hormone and TSHB. Such 
screening may preferably be by means of Northern blot 
analysis to. determine whether mSNA for the rat growth 
15 hormone gene is present and to establish that there is no 
mRNA for the TSHfi gene being produced. The cell lines may 
also be screened by Southern analysis to determine that 
there is at least one copy of the TSHfi gene present within 
the genome. Only the GH cell lines that produce growth 
20 hormone and not TSHfi, but contain a copy of the TSHfi gene, 
are used. 

The specific homologous recombination vector for 
use in GH cells may be designed in the following manner 
(Piigure"3). Region A may consist of the 5» upstream 

25 untranslated region of the TSHfi gene defined by the Hindlll 
fragment which stretches . from *74 to -2785 and region B may 
contain the DNA. fragment that stretches from the -2785 
Hindlll site to a Ncol site approximately .2.1 kb further 
upstream as described by Carr et al (f, ]^X9lf Ql^t, 

.30 262:981-987, 1987)' and Croyle et al ( DNA , 5:299-304, 1986). 
The positive selection gene (region C) may be a 1067 bp 
Bglll-Smal fragment derived from the plasmid pSV2neo (ATCC 
No. 37,149) (Southern et al, J. Mol. ApdI. Gen,. 1:327- 
341, 1982) . The neo gene may be driven . by the Rous 

35 Sarcoma Virus (RSV) promoter (region C) which is derived 
from the Ndel-Hindlll fragment from the plasmid pRSVcat 
. (ATCC No. 37,152) (Gorman et al, PNAS . 79:6777-6781, 
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1982) . In this example, no ainplifiable marker need be 
used and thus there need be no region b in order to 
optimize the efficiency of the homologous recombination. 
The efficiency is inversely related to the proportion of 
5 non-homologous to homologous sequences present in the 
construct (Letsou et al. Genetics, 117:759-770, 1987). 
Region E, or the negative selection gene, may consist of 
the HSVtk gene which is a 2 Jcb Xho fragment obtained from 
the plasmid pMCITK plasmid CCapecchi et al. Nature , 
10 336:348-352, 1988) . The HSVtk gene in that constrilct may 
be driven by the polyoma virus promoter and enhancer 
(region E') as constructed- by Thomas et al (£^, 51:503- 
512, 1987) . In a second DNA construct the polyoma promoter 
may be replaced by the RSV promoter described above. The 
DNA regulatory sequence used to' activate the TSHB gene may 
be either the RSV promoter or the rat growth hormone 
promoter. The rat growth hormone promoter consists of the 
SacI-EcoRI fragment obtained from the plasmid pR6H237CAT 
(Larson et al, m^, 83:8283-8287, 1986). The Rsv promoter 
has the advantage of being usable in other cell lines 
besides GH cells, while the <3H promoter is known to be 
active in GH cells and can be specifically induced (Brent 
®^ Jt Big l f gtiPi, , 264:178- 182, 1989). The rat growth 
hormone promoter and the RSV promotM- may fae" inserted at 
25 location F in separate constructs. 

Following transfection of the above construct 
into a GH cell line, the cells may be grown in media that 
contains G418. This will allow only those cells which have 
integrated plasmid DNA into the genome either by homologous 
recombination or, random integration to grow. The surviving 
cells may be grown in media that contains gancyclovir. The 
majority of the cells that survive this round of selection 
will be those in which the vector plasmid DNA is integrated 
via homologous recombination. These cells nay be screened 
to demonstrate that they are producing mPNA \Aich 
corresponds to the TSHB gene and that they are producing 
the TSHB protein. The genomic DNA may also be sequenced- 
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around the area of insertion of the heterologous promoter 
to insure that the proper recombination event occurred, 

EXAMPLE - Activation of TSHB Gene in Rat Pituitary Cells 
5 Using the following protocol, thyrotropin beta 

. subunit (TSHB) gene transcription, which normally does not 
occur in the rat GH3 pituiteury cell line, was activated in 
. those cells by using the process of homologous 
recombination to target an activating element upstream of 
10 the TSHB coding region. The Rous Sarcoma Virus (RSV) 
promoter is known to function efficiently in GH3 cells 
(Christian Nelson et al. Nature, 322:557-562 (1986); Zheng- 
Sheng Ye et al. The Journal of Biological Chemistry ^ 
263:7821-7829 (1988)) and therefore Was chosen as the 
15 activating element. A plasmid vector was constructed which 
contained the RSV activating element, portions of the 5' 
flanking region of the TSHB gene locus, and a selecteUble 
drug marker, aminoglycoside phosphotransferase gene (NEO) , 
for the isolation of transfected cell populations. 
Ribonucleic acid (RNA) was extracted from pooled drug 
resistant 6H3 cell populations and converted to 
complementary deoxyribonucleic acid (cDNA) • The cDNA was 
then screened by the technique of polymerase chain reaction 
(PCR) for the presence of TSHB cDNA, The constuction of 
25 the homologous recombination vectors and the control 
vectors is outlined below along with the experimental 
procedures and results. 

PLASMID CONSTRUCTION 
30 Homologous Recombination (KRY Backbo ne Vector 
rpRSVCATNEOV . 

The Rous Sarcoma Virus (RSV) promoter was derived 
from the plasmid pRSVCAT (Cornelia M. Gorman et al.. 
Proceedings of the National Academy of Science . 79:6777- 
35 6781 (1982)) (figure 5) by isolating the 580 base pair (bp) 
. Ndel - Hindlll fragment containing the functional promoter 
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\init. The ends of this fragment were blunted using DKA 
polymerase I Klenow fragment and Xbal linkers ligated to 
the blunt ends. After digestion with Xbal restriction 
endonuclease and gel purification, the resulting fragment 
was ligated into the Xbal site of pUCls . A bacterial 
colony harboring a plasmid with the RSV insert in the 
orientation shown in figure 6 was designated pRSV. The 
aminoglycoside phosphotransferase gene (NEO) was cloned 
from PSV2NEO (P.J. Southern et al.. Journal of Molegtn;.r. 
an<^ APPA i ?4 Ge^^t^j-e? , 1:327-341 (1982) ) by Isolating the 
Bglll and BanHI fragment (figure 7) and ligating that 
fragment into the BanHI site of pBSV (figure 6).^.. a plasmid 
containing the NEO gene in the oriientation shown in 
figure 8 was picked and designated pRSVNEOBAM. pRSVNEOBAM 
was digested with Smal and the 4328 bp fragment containing 
the RSV promoter region, the majority of the NEO gene and 
pUCia was isolated by gel electrophoresis. The Smal ends 
of this fragment were Xhol linkered, cleaved with Xhol 
restriction enzyme and the plasniid recircularized by 
20 ligation. The resulting plasmid is shown in figure 9 and 
is called pRSVNEO. This last cloning step resulted in the 
deletion of a 786 bp fragment from the 3 • end of the NEO 
fragment which is not necessary for its functional 
expression. This construction yields a plasmid in which 
the HEO gene is transcriptionally driven by the RSV 
promoter. 

Next the Ndel site located 5' of the RSV promoter 
in pRSVCAT (figure 5) was converted to a Sail site. This 
was accomplished by digesting pRSVCAT with Ndel, filling in 
the ends using DNA polymerase l Klenow fragment and 
ligating Sail linkers to the resulting blunt ends. The 
linkers were digested to completion with Sail and the 
plasmid recircularized by ligation. Into the newly 
constructed Sail site was cloned the Sail - Xhol fragment 
from pRSVNEO (figure 9) containing the RSV promoter and the 
NEO gene. A plasmid with the RSV promoter and NEO fragment 
oriented as shown in figure lo' was isolated and named - 
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pRSVCATNEO. This plasmid when transfected into GH3 cells 
was capable of conferring G418 resistance to those cells, 
demonstrating the ability of the RSV promoter to drive 
transcription of the NEO gene and the ability of that RNA 
5 to be tremslated into a functional protein (data not 

shown). Total RNA from the stable transfectants above was 
analyzed by polymerase chain reaction (PGR) to determine 
whether the CAT gene w:as being transcribed. PGR results 
showed that the CXT gene was indeed being transcribed in 

10 all the G418 resistant colonies tested (data not shown), 
indicating that the RSV promoter 5 ' of the .CAT gene was 
capable of driving transcription of a gene located 3' to 
it. This is important because this RSV promoter will be 
responsible for driving transcription of the TSHB gene when 

15 the TSHB HR vector described below integrates via 
homologous recombination into the GH3 genome. 

TgHS HR Veptor 

A vector capable of integrating into the GH3 
20 . genome by homologous recombination was created by inserting 
two stretches of the 5 • flanking regions 6f the thyrottopin 
beta'subunit (TSHB) gene into the unique Sail and Hindlll 
sites contained in pRSVCATNEO (figure 10). . A rat spleen 
genomic library containing inserts of 15 kilobases (kb) or 
25 greater cloned into lambda DASH was obtained from 

Stratagene, San Diego, CA. Using standard protocols 
r Current Protocols in Molecular Bioloov. pp«1.9.1 - 1.13.6, 
6.1.1 - 6.4.10) a 15.3 kb clone of the rat genomic TSHB 
genei including 9kb of sequence 5* of the first exon wais 
30 isolated*. The 15.3 kb fragment consisted of two Xbal 

fragments, a I0.6 kb fragment corresponding to the 5* end 
of the 15.3 kb fragment and a 4.7 kb piece corresponding to 
the 3» region of the 15.3 kb fragment (figure ii) , Both of 
these Xbal fragments were subcloned into pUC18 and plasmids 
35 containing inserts in both orientations were isolated. The 
2 .3 kb Xbal - Hindlll fragment contained in the 4 • 7 kb Xbal 
fragment (figure 11) was purified and the Xbal site of this 
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-fragment was converted to a Hindlll site by filling i„ the 
ends with Klenow fragment and ligating on Hindlll linkers. 
This fragment was ligated into the unique Hindlll site 
contained in pRSVCATKEO (figure 10). An isolate 
corresponding to a plasmid with the 2.3 kb insert in the 
correct orientation as shown in figure 12 was assigned the 
name pRSVCATNE0TSHB3 . 

The subcloned 10.6 kb Xbal fragment from the rat 
TSHfi done, (figure 11) was isolated and the Xbal ends 
converted to Sail sites by blunt ending the fragment with 
DMA polymerase i Klenow fragment and attaching Sail 
linkers. -This 10.6 kb Sail fragment was then..^ioned into 
the sail site of pRSVCATNB0TSHB3 (figure 12) . A plasmid 
containing the insert in the correct orientation was 
Identified and named pRSVC&TNE0TSHB3-5Xbai (figure 13) 
The latter plasmid has been deposited in the American iype 
culture collection, Rockville, MD, and has received 
depository number ATCC 40933. For the purpose of this 
deposit, the plasmid was renamed pHRTSH. This deposit was 
made in accordance with all of the requirements of the • 
Budapest Treaty. 



C^LL LIHE 

6H 



^3 cells are a subcloned population- of HtT/W5 
Which was derived from a radiation induced pituitary tumor 
xn rats (B.K. Takemoto, gancey Resftnrrti, 22:917 (1962)) 
and adapted to growth in culture by Tashjian et al 
EndocrlTiolonry , 82.-342-352 (1968). The 6H3 cells w^e 
obtained from the American Type culture Collection cell 
t^.T ""^^^^^ ^Iture by growth in Dulbecco-s 
Modified Eagle's Medium (DMEM) + 15% horse serum (HS) + 

«r'"' ^^'^ * L-glutamine (GH3 media) 

at 37»c jLn 5% CO2 • 
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DNA: PREPARATION 

Large-Scale Preparation of Plasmid DNA 

All plasmlds used for stable transfections were 
purified by using the alkaline lysis method for large-scale 
5 plasmid DNA purification as described in Current Protocols 
in Molecular Bloloov, vol* 1, pp. 1.7.1 - 1.7.2; DNA 
isolated by the alkaline lysis method was further piirif led 
by double banding in a cesium chloride gradient as also 
described in Gurrent Protocols in Molecu lar Bioloov . vol. 
10. 1, pp. 1.7.5 - 1.7.7. 

Prior to transfection> the HR vectors were 
' digested with either Aatll or Apal. Apal was used to 
linearize the control plasmid pRSVCATNEO and Aatll to 
linearize the HR plasmid pRSVCATN£0TSHB3-5XbaI. The 
15 location of the cleavage sites of. Apal and AatIZ can be 
seen in figures 10 cmd 13 respectively. After digestion 
with the appropriate restriction enzyme, the reaction was 
phenol/chloroform extracted, chloroform extracted, ethariol 
precipitated, and washed once with 70% ethanol. The 
20 plasmids were then resuspended in sterile deionized water 
(dHjb) to a concentration of 1 microgram/microliter (fxg/fil) 
as determined by absorbance at OD260 . In an attempt to 
increase the transfection efficiency and/or the ratio of 
homologous recombination positives to those that' were due 
25 to random integration, pRSVCATNEOTSHB3-5XbaI was digested 
with Apal. Digestion with Apal cuts at three separate 
sites in pRSVCATNEOTSHB3-*5XbaI and removes all regions of 
. the vector except those necesscury f or homologous 
recombination (figure 13). . Aftiar digestion with Apal, the 
30 reaction was electrophoresed on a 0.8% agarose gel and the 
top band corresponding to the 10,992 bp fragment containing 
. the two s* flanking regions of the TSHB gene, the RSV 
promoter - NEO region and the TSHB gene-activating. RSV 
promoter was isolated from the gel by electroelution into 
35 dialysis tubing. The electroeluted DNA was further 

purified by using an elutip minicolumn (Schleicher and 
Schueli) with the manufacturer's recommended standard 
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protocol. The DNA was eluted from the column, ethanol 
precipitated, washed with 70% ethanol ^ resuspended to a 
concentration of i |tg//ii. 

STABLE TRANSFECTIONS 
Calcium Phosphatt^ Transfefi-Hor^ 

48 hours prior to transfection 3 x 10« GHj cells 
were plated on 10 centimeter (cm) dishes. For each dish 
10 /tg of vector DNA along with 30 of sonicated salmon' 
sperm DNA was added to 0.5 milliliters (ml) of tranfection 
buffer. The transfection buffer was prepared by combining 
4g NaCl, o.l85g Kcl, o.OSg Na^HPO,, o.5g dextrpse, 2.5g 
HEPES and dH,0 to a final volume of 500 ml and teinging the 
PH to 7.5. 31 Ml of 2 molar (M) CaCl, was added to the 0.5 
15 ml of DNA + transfection buffer and vortexed. This 

solution was allowed to stand at room teir5>erature for 45 
: minutes. When the DNA - caci^ - transfection buffer was 
ready, the GH3 medium was removed from the 6H3 cells and 
the DNA - CaCl2 - transfection buffer was layered over the 
20 cells. The cells were allowed to stand at room temperature 
for 20 minutes. After 20 minutes, 5 ml of 6H3 medium was 
added and the plates were incubated at 37 'C for 6 hours 
The cells were then shocked by aspirating off the mediu^ 
and adding 5 ml of fresh transfection buffer containing 15% 
glycerol for 3.5 minutes. The cells were rinsed 2x with 
PBS and fed with 10 ml of GH3 medium. 48 hours post- 
transfection, the medium was removed and 10 ml of GH, 
medium containing 400 /ig/ml G418 was added. 

30 Eleetrnp oration , 

Electroporation was carried out using a BTX 300 
Traasfector with 3.5 millimeter (mm) gap electrodes. 1 x 
107 cells growing in log phase were removed from their 
plates by trypsinization, pelleted by centrifugation and 
35 washed once with PBS. Cells were resuspended in l.o ml of 
PBS and transferred to 2.9 ml Ultra-DV disposable cuvettes 
(American Scientific Products) on ice. 10 /ig of DMA was - 
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added to the cells, mixed and placed back on ice for 5 
minutes. After 5 minutes the electrodes were placed in the 
chamber and the cells were electroporated at a setting of 
750 microfarads with a 200 volt pulse. The cuvette was 
5 then returned to ice for 10 minutes. Cells wer^ 
transferred from, the cuvette to 9 ml of 6H3 mediiun 
containing 1% penicillin smd 1% streptomycin at room 
temperature in a 15 ml conical tube and allowed to stand 
for 10 minutes. The total electroporation of 1 x 10^ cells 
10 was transferred to three 10 cm plates giving approximately 
3 X 10^ cells per plate. After 48 hours> the GH3 medium 
containing 400 Mg/ml G418 was added. 

Transf action of 6H3 cells with pRSVCATN£OTSHB3-5Xbal (Aatll 
cut) , DRSVCATNEOTSHB3-5XbaI fApal CUt^ and pRSVGATMRQ f AnaT 
cut) 

pRSVCATNBOTSHB3-5XbaI (Aatll cut) , 
pRSVCaTNE0TSHB3^5XbaI (Apal cut) and pRSVCATNEO (Apal cut) 
plasmids were transfected into 6H3 cells along with a no 
DNA control using both the calcium phosphate protocol and 
the electroporation protocol. 48 hours after transf ection^ ' 
.the cells were put under G418 selection. Approximately 14 
to 21 days later the colonies became visible by eye on the 
10 cm dishes and were counted. In all of the no DNA 
controls, there were no visible colonies, demonstrating 
that the G418 selection was working and that the presence 
of a plasmid containing the RSV - NEC iregion was necessary 
to confer G4 18 resistance. At this time, colonies were 
picked and pooled by isolating regions on the 10 cm dish 
with 17 millimeter wide cloning rings. These large cloning 
rings encompassed^ between 10 and 70 colonies depending on 
the density of the colonies per plate and allowed the GH3 
cells in that isolated region to be removed and pooled at 
the same time by trypsinatidn. The trypsinized colonies in 
each ring were transferred to 6 well plates and allowed to 
grow in GH3 media containing G418. After reaching 70% to 
80% confluence, 80^000 cells were transferred to a 24 well 
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plate and the remaining cells cryopreserved for further 
testing at a later date. The cells in'the 24 well plates 
were grown until they reached 50% to 80% confluence. Total 
ENA was then harvested from these GHj cells by the 
5 following procedure. 

RNA ISOLATION FROM TRANSFECTED GH, CELLS GROWN IN 24 WELL 
PLATES 

The following is a modification of the protocol 
described by caiomczynski and Sacchi, Anal. 
162:156-159 (1987). The media covering the GH, cells in 
the 24 well Plates was removed and the cells washed with l 
ml of PBS. 1 ml of GTC solutioh was added and the cells 
were incubated at room temperature for 5 minutes. GTC 
solution was prepared by dissolving 250 g of guanidium 
thiocyanate (Fluka) in 293 ml of dH^o, and then adding 
17.6 ml of 0.75 M Na citrate pH 7.0 and 26.4 ml of 10% 
sarcosyl (L-Lauryl sarcosine) . Just prior to use, 360 m 
of B-mercaptoethanol per 50 ml GTC solution was added. 
After 5 minutes at room temperature, the i ml of GTC-cell 
lysate was transferred to a Sarstedt 55.518 snap-cap tube 
containing 2 ml of GTC solution. To each tube was added 
300 Ml of 2M sodium acetate pH 4.0 and the tube vortexed. 
Next, 3 ml of dHjO saturated phenol was added and the tubes 
25 were vortexed again. To each tube was added 600 /tl of 

chloroform: isoamyl alcohol (49:1) and the tube was shaken 
by hand for lo seconds and placed on ice for 15 minutes 
The tubes were then centrifuged in a Sorval RC-5B using a 
SM24 rotor, at 8000 revolutions per minute (RPM) for 20 
minutes at 4«C. The aqueous phase was transferred to a 
fresh sarstedt tube containing 3 ml of isopropanol and 
placed at -.20'C for i hour. After 1 hour the tubes were 
spun in a Sorval RC-SB using a SH24 rotor at 8000 rpm for 
20 minutes at 4 -C. The supematants were removed and the 
pellets resuspended in 500 m of GTC solution. The 
resuspended RNA was transferred to a 1.5 ml eppendorf tube 
to Which 500 fil Of isopropanol was added. The tubes were 
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once again placed at -20**C for 1 hour. The eppendorf tubes 
were, spun for 5 minutes in a microf uge and the supernatant 
discarded; The pellet was washed with 70% ethanol 2 times 
£md allowed to dry until the ethanol had completely 
5 evaporated. The pellet was resuspended in 20 /xl of diethyl 
pyrocarbonate (depc) treated water and heated to 65 for 5 
minutes* This HNA was then used to make cpNA.in one of the 
two procedures described below. 

10 cDNA REACTIONS 
Method 1 

First strand cDNA was synthesized from 2.5-6.0 
microliters of total RNA (approximately 0.5-6 micrograms) 
. in a reaction, volume of 10-20 microliters . The total RNA 
15 was obtained by the extraction method described above, and 
was denatured for 5-10 minutes at 70«G and cpiick chilled on 
ice before adding the reaction components. The reaction 
conditions were .50 miliimolar (mH) Tris-HCl fpH 8.3), 10 xnH 
MgClj^ 10 mH DTT, 0.5 mM each of dCTP^ dATP, dSTP, and dTTP 
20 (Pharmacia) > 40 mH KCl, 500 units/ml BNasin (Promega 

Biotech) r 85 ;zg/ml oligo(dT)i g. i a (Collaborative Research, 
Inc.), and 15,000-20,000 units/ml Holoney murine leukemia 
virus reverse transcriptase (Bethesda Research 
Laboratories) incubated at 37**C for 60 minutes. The 
25 reaction was terminated by the addition of EDTA to 40 mH, 
and the nucleic acid was precipitated by adding sodixim 
acetate to a concentration of 0.3 H and two volumes of 
ethanol. The precipitate was; allowed to form at 0"C for 30 
minutes and was pelleted by centrifugation .in a micrbfuge 
30 at 14,000 rpm for thirty minutes. The pellet was washed 
with 70% ethamol, dried, and resuspended in depc treated 
water, to a volume of 15-25 microliters. 

Wet^pq 3 

35 Conditions for first strand synthesis of cDNA 

from RNA were adapted from Carol A. Brenner et al, 
BioTeehnicfues. Vol. 7, No. 10, pp. 1096-1103 (1989). 1 /xl 
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Of total SNA from the SNA prep procedure described above 
was added to 9 /il of reaction buffer iii a 0.5 ml eppendorf 
tube. The reaction buffer consisted of 200 units of 
Moloney murine leukemia virus reverse transcriptase (MMLVRT 
5 Bethesda Resesarch Labs) , and a final concentration of the 
following reagents: 70 mM Tris.HCl pH 8.8, 40 mM KCl, 0.1% 
Triton X-100, 1 mM of each dNTP, 4 mM MgClj , and 0.45 ODj^q 
units of random hexamers (Pharmacia) . After mixing, the 
tubes were incubated at room temperature for 10 minutes and 
10 then placed at 42 »C for 1 hour. After 1 hoxir the tubes 

were heated to 90 for 1 minute to deactivate the MMLVRT 
"and then cooled to room temperature. 

POLYMERASE CHAIN REACTION (PCR) AMPLIFICATION OF RNA FROM 
GH3 CELLS 

The following primers were used to amplify, by 
PCR, TSHB CDMA synthesized from RNA transcripts produced by 
the GH3 cells as a result of the BR plasmids activating the 
endogenous TSHfi gene by homologous recombination. 

20 primer 5* 3» 

TSH&5 AGTATATGATGTACGTGGACAGG 
TSH63 CACTT6CCACACTTGCAGCTCAGG 

Figure 14 shows the regions of the TSHB gene to 
25 which each primer corresponds. 

PCR REACTION CONDITIONS 

All PCR reactions were performed in the Ericomp 
Twinblock thermocycler. If PCR amplification was to be run 

30 on cDNA made by method 2, 40 m of additional reaction mix 
\ias directly added to the 10 nl of the cDNA reaction 
bringing the total volume up to 50 /il. The final 
concentrations of reagents in the 50 nl were 70 mM Tris.HCl 
pH 8.8, 40 mM KCl, 0.1% Triton X- 100, 2.25 units Tag 

35 polymerase (Pharmacia), 0.2 micromolar (ixM) each primer, 
200 AtM each dNTP, and 0.8 mM MgClj . 
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If PGR was to be performed on cDNA made by method 
1 above/ 5 to 10 /xl of the resuspended cDNA was added to 40 
to 45 /il containing filial concentrations of the following: 
70 mM Tris.HCl pH 8.8, 40 mM ¥iCi, 0.1% Triton X-100, 2.25 
5 units Tag polymerase, 0.2 each primer, 200 fM each dNTP, 
and 0.8 mM HgClj. 

The reactions were then si2bjected to the 
following PGR cycles.. 

1 minute at 94 'C. 
10 30 seconds at 55^C« 

2 minutes at 72 ^C. 

The above cycle was repeated 30 'to 40 times. 10 
til of. each reaction mix was run on a 6% polyacrylamide gel 
. and screened for the presence of a 247 bp PGR fragment 
15 which would indicate the presence of the properly spliced 
mRNA for TSHB. 

PGR RESULTS FOR AMPLIFICATION OF TSHB RNA FROM GH3 CELLS 
AND RAT PITIJITARY GLAND TOTAL RNA 

2Q To determine whether GH3 cells normally 

synthesize TSHB RNA, cDNA from untransfected GH3 cells as 
well as cDNA from rat pituitary glands was subjected to the 
above PGR reaction conditions. The correct 247 bp band 
indicative of the presence of TSHB inRNA was visible in the 

22 positive control of the rat pituitary gland sample but no 
band was visualized from the GH3 cell total RNA sample even 
after 60 cycles (data not shown) • . 

TRANSFEGTION RESULTS 

The number of G418 resistant colonies present on 
the 10 cm dishes were tabulated between 14 and 21 days 
after addition of G418 to the media. 
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Trapgfeetjon yiethod Colonies ner in r-m h,-.,k 

.pRsvayyNEO prsvcattjt!otshb3-5 y^^ i 

ftp^y <PAt Aat2 eut 
Calcixm phosphate 1 48 13 29 
Calcium phosphate 2 — 21 53 
Electroporation 1 — i295 415 
Electroporation 2 ~ ^qsi 723 

Total SNA was harvested from the colony pools 
contained in the 24 well plates as described above. cDNA 
was made from these BSA preps and subjected to PCR 
aii5)lification. The number of positive colonies producing 
TSHB mRNA was determined by the presence of a 247 bp 
fragment as visualized on a polyacrylamide gel. Each of 
the pools screened contained between 10 and 70 colonies. 
The estimated number of coloniek per pool per transf ection 
was used to approximate the number of G418 resistant GH3 
cell clones in which TSHB gene transcription was activated. 
If a pool tested positive, it was assumed that this 
represented one positive colony present in that particular 
pool. 

^aaaaS ais ^esjstm^ gaaiari^ smjmjEcsi^ 

25 pRSVCATNEO gO 0 

PRSVCATNEOTSHB3-5XBA1 4942 ■> 

(Aat2 digested) ^ 

PRSVCATNE0TSHB3-5XBA1 aS80 e 

(^al digested) ^ 

30 These results demonstrate the successful 

activation of the normally transcriptionally silent TSHB 
gene hy the method of the present Invention. While the 
number of colonies that are positive for TSHB transcription 
is small compared to the number of colonies that are G418 

35 resistant (approximately one out of every lo' 6418 
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resistant colonies) , this result is generally consistent 
with rates reported for other homologous recombination 
experiments (Michael Kriegler, Gene Transfer and Expression 
A Laboratory Manual , Stockton Press ^ New York, NY (1990), 
5 pp, 56 - 60)'. It has been generally observed that the 
homologous recombination rate seems 1:o be proportional to 
the rate of transcription of the targeted gene (M, Frohman 
and G. Martin, Cell , 56:145 (1989); L. Mansour et al, 
Nature , 336:348 (1988)). It should be noted that the rate 
10 which has been demonstrated is three orders of magnitude 
higher than what might be expected for random mutation 
ttimihg oh the TSHB gene. 

To ensure that the results for each colony pool 
were reproducible and that the activation of RNA 
15 transcription was stable, colony pools previously frozen 
away corresponding to pools whidh tested positive in the 
first screening were thawed and esqpanded in culture. The 
freshly thawed GHj positive pools were seeded in T 25 
tissue culture flasks and expanded until the cells reached 
20 70% to 80% confluence. 80,000 cells were then plated in 24 
well plates from each flask and grown until they were 50% 
.to 70% confluent. RNA was/extracted from the cells, 
converted into cDNA, and screened once again for the 
presence of TSHB RNA by running 10 fil of each PCR reaction 
25 on a 6% polyacrylamide gel. Figure 15 shows the results 
of representative PCR reactions from the second screening 
as visualized on a polyacrylamide gel by ethidium bromide 
staining and fluorescence. Lauies 1, 2, and 3 contain the 
PCR reactions run on cDNA from GH3 cells which had been 
30 trahsfected by pRSVCATNEO. pRSVCATNEO contains no regions 
of homology to TSHB and thus is . not capable of activating 
the TSHfi gene by homologous recoinbinatipn. As can be seen 
on the gel in figure 15, there are no bands corresponding 
to 247 bp in those lanes indicating that the TSHfi gene is 
35. not activated. Lane 6 also contains a negative control. 
In that lane three pools were combined from samples of GH3 
cells which had been transfected with pRSVCATNEOTSHB3-5XbaI 
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(Apal cut) but which were negative for transcription of the 
TSH6 gene on the first screening. The 'absence of the 247 
bp fragment in lane 6 demonstrates that the pres^ce of the 
transfected pRSVC&TNEOTSHB3-5XbaI (Apal cut) plasmid 
integrated randomly in the genome is not capable of 
producing the 247 bp TSHB PCR fragment. Lanes 7, 8, 9, and 
10 include PCR reactions run on cDNA made from total RHA 
harvested from rat pituitary glands in quantities per 
reaction of 25 nanograms, lOO nanograms, 200 nanograms, and 
400 nanograms, respectively. The pr^^ce in' these lanes 
of the expected 247 bp band, produced from cDNA prepared 
from a rat tissue which normally expresses TSHB, showed 
that the PCR reaction conditions were correctly 
optimized and that the PCR band obtained in lanes 4 and 5 
containing the homologous recombination TSHB positives is 
of the correct size. Two pools transfected with 
pRSVCATNEOTSHB3-5XbaI (Apal cut) which were positive in the 
first screening, Apal-107 in lane 4 and Apal-136 in lane 5, 
once again tested positive for TSHB gene activation as 
demonstrated by the presence of the correct TSHB PGR band 
amplified from cDNA made from the total RNA extracts from 
those pools proving that transcription of TSHB gene has 
been stably activated. The presence of bands at 247 bp in 
lanes 4 and 5 containing RNA from previous positives Apal- 
107 and Apal-136 and the absence of bands in the negative 
controls of pRSVCATNEO transfected GH3 cells in lanes 1 - 3 
and the pRSVCATNEOTSHB3-5XbaI (Apal cut) negatives in lane 
6 demonstrated that the production of TSHB RNA in a cell 
line that does not normally produce that RNA has been 
stably turned on by homologous recombination. 

The present invention is not limited to the cell 
line that is described herein. All cell lines have genetic 
information which is normally silent or inert. Most are 
able to express only certain genes. However, a normally 
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transcriptionally silent or inert gene of any such cell 
line can be activated to express the gene product in 
accordance with the present invention and any gene of the 
genome nay have its expression characteristics modified in 
5 accordance with the present invention* Even previously 
transformed cell lines can be used as long, as the previous 
transformation did not disrupt the gene of 'interest. The 
sotirce of the cell line, is not important. The cell line 
may be animal or plemt, primary, continuous or immortal. 
10 Of course, it is desireO^le that any such cell line be 
stable and immortal so that after treatxaent with the 
technique in accordance with the present invention, 
expression can be commercialized. Cloned microorganisms, 
whether prokazyotic or eukaryotic, may also be treated by 
15 the technique of the present invention. 

nhile the present invention has been preferably 
described with respect to the expression of a nozmally 
transcriptionally silent or inert gene, the technique of 
the present invention is also ajpplicable to the 
20 modification of the expression characteristics of a gene 
which is naturally expressed in the host cell line. For 
example, if it is desired to render the expression of a 
gene dependent upon culture conditions or the like so that 
expression can be turned on and off at will, an appropriate 
25 DNA. regulatory segment, such as a regulatable promoter, can 
be inserted which imparts such characteristics, such as 
repressibility or inducibility. For example, if it. is 
known that the cell type contains nuclear steroid 
receptors, such as estrogen, testosterone or 
30 glucocorticoid, or thyroxin receptors, one could use the 
steroid or thyroxin response elements as region F. Such a 
response element is any DNA which binds such receptor to 
elicit a positive response relative to transcription. Even 
if a cell is not naturally responsive to glucocorticoids, 
35 for example, a piece of DNA which encodes the 

glucocorticoid receptor could be added to the construct, or 
otherwise inserted somewhere in the genome, so as to m£dce 
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the cell responsive to glucocorticoids. The use of a 
regulatable promoter could be desirable vheth^ or not the 
gene of interest is normally transisriptionally silent, 
other kinds of regulation can also be obtained by targeting 
the appropriate DNA regulatory segment to the exact 
position of interest by means of the process of the present 
invention. 

Thus, while stimulation of expression of normally 
transcriptionally silent genes is the preferred application 
of the present invention, in its broadest sense it is 
applicable to the modification of expression 
characteristics of any gene endogenous to the host cell 
line. 

The specific technique of homologous 
recombination is not, per se, a novel part of the present 
invention, such techniques are known and those of ordinary 
skill in this art will understand that any such technique 
can be used in the present invention as long as it permits 
targeting of the DNA regulatory sequence to the desired 
location with respect to the gene of interest. While a 
preferred technique is disclosed, using a linearized 
construct with two homologous regions on either end of the 
sequences to be inserted, any ether technique which will 
accomplish this function, as, for example, by using 
circular constructs, is also intended to be comprehended 1;^ 
the present invention. The critical feature of the present 
invention is the use of homologous recombination techniques 
to insert a DHA regulatory sequence which causes 
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modification of expression characteristics in the cell line 
or microorganism being used, operatively linked with a gene 
in the genome of the cell line, preferably one which is 
normally transcriptionally silent, or to insert an 
5 amplifiable sequence^ without a regulatory sequence, 

sufficiently near a gene In the genome of the cell line 
which already transcribes as to cause amplification of such 
gene upon amplification of the amplifiable sequence. It is 
not absolutely necessary that a selectable marker also be 

10 included. Selection can be based solely on detection of 

the gene product of interest or mlUfAs in the media or cells 
following insertion of the DNA construct. Furthemdore, in 
the embodiment in which a regulatory sequence is being 
. inserted, amplification, while desired, is not critical for 

15 operability. The same is true for the negative selection 
gene which makes the screening process easier, but is not 
critical for the success of the invention; Thus, the basic 
embodiment requires only insertion of the DNA regulatory 
segment or the amplifiable segment in the specific position 

20 desired. However, the addition df positive and/or negative 
selectable marker genes for use in the selection technique 
is preferred, as is the addition of an amplifiable gene in 
the embodiment in which a regulatory segment is being 
added. 

25 The term "modification of expression" as used 

throughout the present specification and claims, is hereby 
deifined as excluding termination .of expression by inserting 
by homologous recombination a mutjation, deletion, stop 
codon, or other nucleotide sequence, including an entire 

30 gene, into the gene of interest, so as to prevent the 

product of interest from being expressed. The prior art 
teaches the use of homologous recombination to insert 
specific mutations and the expression of a cell product may 
have inherently been terminated by means thereof (see, for 

35 example, Schwartzberg et al, PNAS (USA), 87:3210-3214 
(1990) ) . The present invention is not intended to 
encompass such a procedure. In the present invention the 
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"modification of expression" is accomplished by means of 
inserting regulatory and/or amplification regions at a 
specific desired location by means of homologous 
recombination. The preferred modifications are those which 
activate and/or enhance expression of the product of 
Interest. 

Whenever the present specification uses the 
phrase that a DNA regulatory segment is "operatively linked 
with" a gene, such terminology is intended to mean that the 
DNA regulatory segment is so disposed with respect to the 
gene of interest that transcription of such gene is 
regulated by that DMA regulatory segment. The regulatory 
segment is preferably upstream of the gene, but may be 
downstream or within the gene, provided that it operates to 
regulate expression of the gene in some way. ThB DHA 
regulatory segment may be a promoter, terminator, operator, 
enhancer, silencer, attenuator, or the like, or any 
combination thereof. 

Whenever the terms "upstream" or "downstream" are 
20 used in the present specification and claims, this is 

intended to mean. in the 5 '-direction or the 3 '-direction, 
respectively, relative to the coding strand of the gene of 
interest. 
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The foregoing description of the specific 
embodiments so fully reveals the general nature of the 
invention that others can readily modify and/or adapt such 
specific embodiments for various applications without 
departing from the generic concept. Any such adaptations . 
and modifications are intended to be embraced within the 
30 meaning and range of equivalents of the disclosed 

embodiments, it is to be understood that the phraseology 
and terminology employed herein are for the purpose of 
description and not of limitatibn. 
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WHAT IS CLAIMED IS: 

1. A method of activating a normally 
transcriptionally silent gene within the genome of a cell 
line or microorganism so as to enable said cell line or 
5 microorganism to express the gene product of said gene, 
comprising iniserting a DNA construct into said genome by 
homologous recombination, said DNA construct comprising a 
ONA regulatory segment capable of stimulating expression of 
said gene when operatively linked thereto and a ONA 

10 targeting segment homologous to a region of said genome 

within or proximal to said gene, wherein said construct is 
inserted such that said regulatory segment is operatively 
linked to said gene of interest, 

2. A method in accordance with claim 1,^ 21, or 

15 22, wherein said DNA construct comprises two DNA targeting 
segments, each homologous to a region of said genome within 
or proximate to said gene, one of said targeting segments 
being . upstream of said regulatory segment and the. other of 
said targeting segments being downstream of said regulatory 
20 segment. 

3* A method in accordance with claim 1, 2, or 
21, wherein said DNA construct additionally comprises at 
least one expressible selectable marker gene disposed so as 
to be inserted with said regulatory segment « 
25 4. A method in accordance with claim li* 2, 3, 

21, 22, or 23,. wherein said DNA construct addit^ionally 
comprises a negative! selectiable marker gene disposed with 
respect to said targeting segment so as not to be inserted 
when said construct is properly inserted by homologous 
30 recombination, whereby said negative selectable siarker is 
not efxpressed in cells in which said DNA construct is 
properly inserted. 

5. A method in accordance with claim 1, 2, 3, 4, 
or 21, wherein said DNA construct additionally comprises an 
35 expressible amplifiable gene disposed so as to be inserted 
with said regulatory segment. 



wo 91/09955 



PCr/US90/07642 . 



- 44 - 

6. A method accordance with claim i, 2, 3, 4 
5, 21, 22, or 22, wherein said cell line or microorganism 
is a eukaryotic cell line. 

7. A method in accordance with claim 6, wherein 
5 said cell line or microorganism is an animal cell line. 

8. A method in accordance with claim 6, wherein 
said cell line or microorganism is a mammalian cell line. 

9. A method in accordamce with claim 6, wherein 
said cell line or microorganism is a plant cell line. 

10. A method in accordance with claim 3, and 
additionally for causing expression of said gene product, 
further including the steps of,, following said inserting 
step: 

selecting clones of said cell line or 
15 microorganism which express the product of said selectable 
marker gene; 

cultivating the selected clones \mder conditions 
sufficient to permit expression of said gene product; and 

collecting said gene product. 

13.» A method in accordance with claim 10, 
wherein said selectable marker gene is the neomycin 
resistance gene and said selecting step comprises selecting 
those clones having neomycin resistance. 

12. A method in accordance with claim lo or 11, 
25 wherein said DNA construct additionally comprises a 

negative selectable marker gene disposed with respect to 
said targeting segment so as not to be inserted when said 
construct is properly inserted by homologous recombination, 
whereby said negative selectable marker is not expressed in 
30 cells in which said DNA construct is properly inserted, and 
said selecting step further includes selecting those clones 
which do not express said negative selectable marker gene. 

13. A method in accordance with claim 12 , 
wherein said negative selectable marker gene is the Herpes 

35 Simplex Virus thymidine kinase gene and said selecting step 
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Includes selecting those clones which survive exposure to a 
media that kills cells which express said gene* 

14. A genome having a DNA regulatory segment 
operatiyely linked with a naturally occurring gene at an 
5 insertion site characterized by a predetermined ONA 

sequence, said DNA regulatory segment not being naturally 
occurring at said location in the genome. 

15* A ceil line or microorganism capable of 
expressing ^ gene product by a normally transcriptionally 
10 silent gene within the genome of said cell line or 

microorganism, said genome having inserted therein a DNA 
regulatory segment operatively linked with said normally 
transcriptionally silent gene, said DNA regulatoiry segment 
being capable of promoting the expression of a gene product 
15 by said cell line or microorganism. 

1S« A cell line or microorganism in accordance 
with claim ^5 or 25, wherein said DNA regulatory segment is 
one which, is capable of promoting the expression of a gene 
product normally expressed by said cell line or 
20 microorganism. 

17. A cell line or microorganism in accordance 
with clajLm 16, wherein the inserted DNA regulatory segment 
is pairt of a DNA construct comprising said DNA regulatory 
segment and at least one selectable marker gene. 
2^ 18* A cell line or microorganism in accordance 

with claim 17, wherein said DNA construct additionally 
comprises an amplifiable gene« 

19. A method of obtaining a gene product from a 
cell line or microorganism, comprising culturing a 
30 differentiated cell line or microorganism in accordance 
with claim 15-18 or 24-26 under conditions which permit 
expression of said gene product, and collecting said gene 
product. 
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20. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising a 
DHA regulatory segment capable of modifying the expression 
characteristics of genes in the host cell line or 

5 microorganism when operatively linked thereto and a DNA 

targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 
microorganism. 

21. A method of modifying the expression 

10 characteristiics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 
comprising a DNA regulatory segment capable of modifying 
the expression characteristics of said gene when 

IS operatively linked thereto, as compared to its «cisting DNA 
regulatory segment, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 
wherein said construct is inserted such that said 
regulatory segment is operatively linked to said gene of 

20 interest. 

22. A method of modifying the expression 
characteristics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 
comprising an expressible, amplifiable gene capable of 
amplifying said gene when inserted in sufficiently close 
proximity thereto, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 
wherein said construct is inserted such that said 
amplifiable gene is in sufficiently close proximity to said 
gene of interest to cause amplification thereof when said 
amplifiable gene is amplified. 

23. A method in accordance with claim 22, 
wherein said DNA construct additionally comprises at least 

35 one expressible selectable marker gene disposed so as to be 
inserted with said expressible, amplifiable gene. 



25 



30 
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24* A cell line or microorganism capable of 
enhanced expression of a gene product compared to the cell 
line or microorganism from which it is derived, said gene 
product beipg the expression product of an endogenous gene 
5 within the genome of said cell, said genome having inserted 
therein in an operative manner, at or near said endogenous 
gene, an exogenous DNA regulatory segment^ and/or 
amplifiable gene capable of enhancing the B^xesaxon of 
said gene product by said cell line or microorganism. 
10 25. A cell line or microorganism in accordance 

with claim 24, wherein said exogenous. DNA regulatory 
segment and/or amplifiable gene is an exogenous DNA 
regulatory segment. 

26. A cell line or microorganism in accordance 
15 vlth claim 24, wherein said exogenous DNA regulatory 

segment and/ or amplif liable gene is an exogenous amplifiable 
gene. 

27. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising 

20 an expressible, amplifiable gene capable of amplifying a 
gene in the host cell line or microorganism when inserted 
in sufficiently close proximity thereto, smd a DNA 
targeting segment homologous to a region of the genome of a 
preselected gene within, the host, celi line or 

25 microorganism. ... 

28. A method in accordance with claims l, 2, 3, 
4, 5, 21, 22, or 23, wherein said cell line or 
microorganism is a microorganism. 



35 
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LOCATION OF PRIMERS FOR PGR AMPLIFICATION OF TSH BETA 

•5' 

g gcocgcctctgaatgtggaaa g gacacttotgogctctgtggtctttccctctgattt 
a g CATGMr6CT6TC6TTCTCTTTTCC6TGCTTTTCGCTCTTGCTTCTtK5GCAA^ 

5' TSHB5 3' 

A GTATATCATGTACGTGGACAGG 
C ATCGTTTTGTAT TCCCACT G A G T ATATGATGTACGTG®CAGGAGAGiW?TGTCCCrAC 

TGCCTGACCATCAACACCACCATCTGCGCreGGTATTGTATGAGACGG gtatgttggt 

c a c f g c g 1 1 1 c 1 1 1 1 age t g t o g a 1 1 g f qcagg f c taqag 1 1 g t ctgtt qatatt t tag 
ooqggoagtgggaf aaatcat 0 g t ctcctctttgggaagccaagcacoctgctttcago 
cittat:aattatgtcattctococagaaa aogt a c a ga to cat t g t aacag 1 1 taccct a 

oogtgtttgttctgctcaatgg tog a tgagaogaoagtgtcctttfttgtctctgaggg 
g t toogtgtagat gtgtggg ta aGagagctcaggqgtcctttaagdtcatcaggaoaca 
aagggo tot togtca ttctq 1 1 a coc taagttgcatgcxjgtttotcatgttqagatctc 
t t tt cttccacog GATATCAATGGCAAACTGTTTCTTCCCAAGTACGCACTCTUTCAS 

G ATGTCTGTACATACAGAGACTTC ACCTACASAACGGlGGAAATACCGGGATCCCCflCA 
* * X X X X X XXXXX X X X*« X X X X X X X X X X XX X XX XX X»X »K X XXXXXX X xxxxx x*xxxx 

C C AT6TT6GTCCTTATTTC TC CT A CCCCGTrGCCCTGASCTCCAAGTCTCGCAASTGlR 
X X XXX XX XXXXXXXXXX KX X X X X X X X xxxxxxxxx xxxxxxxxxxxxxxxxxxxxxx 

G 6A:nCGACGTTCACACCGTrcAC 
3' .TSHB3 5' 

ACACT6ACTACAGCGACTGTA C AC ACGAGGCTGTCAAAACCAACTACTGCACCAASCCA 
C AGAC ATTCTATCTG6GGG GATTTTCTGGTTAACTGTAATQGCAATGCAATCTG GTTAA 
ATGT6TTTACCTGGAATAGAACTAATAAAATATCATTGAT otg tct tgcc tgc cdtt t 

oa tccqtaggcocatccacaaggcattagagagcttacocaactttagaogcogaggcg 

■ ' 3' ' 

EXONS 2 AND3 ARE IN CAPITAL LETTERS 
247 BP AMPLIFIED FRAGMENT UNDERLINED BY * 

SUBSTiWrt SHEET 
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